BIOTECHNOLOGY Sfi m-S- 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 09 j 

Source: 

Date Processed by STIC: /2 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help(o)uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(5),uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



^ Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST. 25. 

Checker Version 3 0 replaces the previous DOS-based version of Checker, and is Y2K- 
comphant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer crrorcd sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http^//wwAV.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 0 < j^ < ^ 9i / LL (j 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWAF 



1 Wrapped Nuclcics 

Wrapped Aminos 



The numberAext at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping," 



Invalid Line Length The rules require that a line not exceed 72 characters in length. *fhis includes white spaces. 



Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use apace characters, instead. 

Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission Is saved In ASCII Uit. 

- ; V 

Variable Length Sequencers) contain n's or Xaa's representing more than, one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section thai some may be missing. 

Patentln 2.0 A "bug" in Patentln version 2.0 has caused ftfe <220>-<223> section to be missing from amino acid 

"bug" sequencers) . Normally, Pa^entjh would" automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy th« relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



_Skipped Sequences Sequencers) 
(OLD RULES) 



missing. If intentional, please insert the following tines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS; (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(">) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



Skipped Sequences' Sequence^) 

' (NEW RULES) 



missing. If Intentional, please insert the following lines for each skipped sequence 



9 Use of n's or Xaa's 

(NEW RULES) 

10 ^ Invalid <213> 

Response 

11 Useof<220> 



12 Patentln 2.0 

"bug" 



<2 1 0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been delected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2 1 3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <2 20>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 



Sequences) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(Sec "Federal Register" 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 - Misuse of n ' n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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RAW SEQUENCE LISTING DATE: 12/06/2001 

PATENT APPLICATION: US/0 9/89 3,666 TIME: 10:36:14 

Input Set : A:\210217US0.ST2 5.txt 
Output Set: N:\CRF3\12062001\I893666.raw 

APPLICANT: YAMASHITA, ICHIRO 

TITLE OF INVENTION: High estrogen - sensitive medaka fish 

FILE REFERENCE: 210 217US - 62 0 - 7 24 9 - 0 

CURRENT APPLICATION NUMBER: US 09/893,666 

CURRENT FILING DATE: 2001-06-29 

PRIOR APPLICATION NUMBER: JP 2000-247729 

PRIOR FILING DATE: 2000-08-17 

NUMBER OF SEQ ID NOS : 7 

SOFTWARE : Patent In version 3.1 

SEQ ID NO: 1 c\rstt ' 



LENGTH : 27 6 4 . . , 




TYPE: DNA CiW' r 
ORGANISM: Oryzias latipes 
FEATURE: 
NAME/KEY: CDS 

LOCATION: ( 211) . . ( 1935 ) 'I 
OTHER INFORMATION : 
SEQUENCE: 1 

3 1 gtctcgctgc tagatgcctg tcaggcaggc agagaggaag cagcccgtgt tgcgcagcac 60 
3 3 atctgaggat gattcatgag taagagacag agctcggtgc agatcaggca gctgttcgga 120 
35 ccagcaetca gatccaggat cagcccagcc tcctcagagc tggagaccct ctccccacct 180 
cgcctctcgc cccgtgaccc cctcggtgac atg tac cct gaa gag age egg ggt 234 
38 Met Tyr Pro Glu Glu Ser Arg Gly 
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RAW SEQUENCE LISTING DATE: 12/06/2001 

PATENT APPLICATION: US/09/89 3 , 666 TIME: 10:36:14 



Input Set : A:\2L0217US0.ST25.txt 
Output Set: N:\CRF3\12062001\I893666.raw 
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acttgttgaa 


catttgtgca cttgtccgtg 


2565 


205 


tttttctagc actgtagctt atgaacgctg 


agaaagaatc 


taatgctttg atgeacagat 


2625 


207 


ttgc 


cttgta ttgtacatct cagccacaaa 


egtactttte 


gtccacaagt tgactgactg 


2685 


209 


cacc 


ttgatt aaattgtcta aaagttcatt 


taaatgttga 


attctgtgaa aattaaaaag 


2745 
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211 qcaattcctg tttctattt 2764 

214 --. 210> SEQ ID NO: 2 

215 211> LENGTH: 575 
21b -212> TYPE: PRT 

217 - 213> ORGANISM: Oryzias latipes 

219 • 4 0 0> SEQUENCE: 2 



221 


Met 


Tyr 


Pro 


Glu 


Glu 


Ser 


Arg 


Gly 


Ser 


Gly 


Gly 


Val 


Ala 


Ala 


Val 


Asp 


2 2 2 


1 








5 










10 










15 




2 2 5 


Phe 


Leu 


Glu 


Gly 


Thr 


Tyr 


Asp 


Tyr 


Ala 


Ala 


Pro 


Asn 


Pro 


Ala 


Thr 


Thr 


2 26 








20 










25 










30 






2 2 9 


Pro 


Leu 


Tyr 


Ser 


Gin 


Ser 


Ser 


Thr 


Gly 


Tyr 


Tyr 


Ser 


Ala 


Pro 


Leu 


Glu 


2 30 






35 










4 0 










45 








2 3 3 


Thr 


Asn 


Gly 


Pro 


Pro 


Ser 


Glu 


Gly 


Ser 


Leu 


Gin 


Ser 


Leu 


Gly 


Ser 


Gly 


2 34 




50 










55 










60 










2 3 7 


Pro 


Thr 


Ser 


Pro 


Leu 


Val 


Phe 


Val 


Pro 


Ser 


Ser 


Pro 


Arg 


Leu 


Ser 


Pro 


2 38 


6 5 










70 










75 










80 


24 1 


Phe 


Met 


His 


Pro 


Pro 


Ser 


His 


His 


Tyr 


Leu 


Glu 


Thr 


Thr 


Ser 


Thr 


Pro 


242 










85 










90 










95 




24 5 


Val 


Tyr 


Arg 


Ser 


Ser 


His 


Gin 


Gly 


Ala 


Ser 


Arg 


Glu 


Asp 


Gin 


Cys 


Gly 


24 6 








100 










105 










110 






2 4 9 


Ser 


Arg 


Glu 


Asp 


Thr 


Cys 


Ser 


Leu 


Gly 


Glu 


Leu 


Gly 


Ala 


Gly 


Ala 


Gly 


2 5 (') 






L15 










120 










125 








2 53 


Ala 


Gly 


Gly 


Phe 


Glu 


Met 


Ala 


Lys 


Asp 


Thr 


Arg 


Phe 


Cys 


Ala 


Val 


Cys 


254 




130 










135 










140 










2 57 


Ser 


Asp 


Tyr 


Ala 


Ser 


Gly 


Tyr 


His 


Tyr 


Gly 


Val 


Trp 


Ser 


Cys 


Glu 


Gly 


258 


145 










150 










155 










160 


261 


Cys 


Lys 


Ala 


Phe 


Phe 


Lys 


Arg 


Ser 


He 


Gin 


Gly 


His 


Asn 


Asp 


Tyr 


Met 


2 6 2 










165 










170 










175 




26 5 


Cys 


Pro 


Ala 


Thr 


Asn 


Gin 


Cys 


Thr 


He 


Asp 


Arg 


Asn 


Arg 


Arg 


Lys 


Ser 


266 








180 










185 










190 






269 


Cys 


Gin 


Ala 


Cys 


Arg 


Leu 


Arg 


Lys 


Cys 


Tyr 


Glu 


Val 


Gly 


Met 


Met 


Lys 


270 






195 










200 










205 








2^3 


G ly 


Gly 


Val 


Arg 


Lys 


Asp 


Arg 


lie 


Arg 


He 


Leu 


Arg 


Arg 


Asp 


Lys 


Arg 


274 




210 










215 










220 










277 


A rg 


Thr 


Gly 


Val 


Gly 


Asp 


Gly 


Asp 


Lys 


Val 


Val 


Lys 


Gly 


Gin 


Glu 


His 


278 


225 










230 










235 










240 


281 


Lys 


Thr 


Val 


His 


T , /r 


Asp 


Gly 


Arg 


Lys 


Arg 


Ser 


Ser 


Thr 


Gly 


Gly 


Gly 


282 










245 










250 










255 




285 


G 1 y 


Gly 


Gly 


Gly 


G ly 


Gly 


Arg 


Leu 


Ser 


Val 


Thr 


Ser 


He 


Pro 


Pro 


Glu 


28b 








260 










265 










270 






289 


Gl n 


Val 


Leu 


Leu 


Leu 


Leu 


Gin 


Gly 


Ala 


Glu 


Pro 


Pro 


He 


Leu 


Cys 


Ser 


2 90 






275 










280 










285 








293 


A rg 


Gin 


Lys 


Leu 


Ser 


Arg 


Pro 


Tyr 


Thr 


Glu 


Val 


Thr 


Met 


Met 


Thr 


Leu 


294 




290 










295 










300 










297 


Leu 


Thr 


Ser 


Met 


Ala 


Asp 


Lys 


Glu 


Leu 


Val 


His 


Met 


He 


Ala 


Trp 


Ala 


298 


305 










310 










315 










320 


3 0 1 


Lys 


Lys 


Leu 


Pro 


Gly 


Phe 


Leu 


Gin 


Leu 


Ser 


Leu 


His 


Asp 


Gin 


Val 


Leu 


3 0 2 










325 










330 










335 




3 0 5 


Leu 


Leu 


Glu 


Ser 


Ser 


Trp 


Leu 


Glu 


Val 


Leu 


Met 


He 


Gly 


Leu 


He 


Trp 
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306 








340 










345 










350 






3 0 9 


Arg 


Ser 


He 


His 


Cys 


Pro 


Gly 


Lys 


Leu 


He 


Phe 


Ala 


Gin 


Asp 


Leu 


He 


310 






355 










360 










365 








313 


Leu 


Asp 


Arg 


Asn 


Glu 


Gly 


Asp 


Cys 


Val 


Glu 


Gly 


Met 


Thr 


Glu 


lie 


Phe 


314 




370 










375 










380 










317 


Asp 


Met 


Leu 


Leu 


Ala 


Thr 


Ala 


Ser 


Arg 


Phe 


Arg 


Val 


Leu 


Lys 


Leu 


Lys 


318 


385 










390 










395 










400 


321 


Pro 


Glu 


Glu 


Phe 


Val 


Cys 


Leu 


Lys 


Ala 


He 


He 


Leu 


Leu 


Asn 


Ser 


Gly 


322 










405 










410 










415 


325 


Ala 


Phe 


Ser 


Phe 


Cys 


Thr 


Gly 


Thr 


Met 


Glu 


Pro 


Leu 


His 


Asn 


Ser 


Ala 


326 








420 










425 










430 






329 


Ala 


Val 


Gin 


Ser 


Met 


Leu 


Asp 


Thr 


He 


Thr 


Asp 


Ala 


Leu 


He 


His 


Tyr 


3 3 0 






4 35 










440 










445 






333 


He 


Ser 


Gin 


Ser 


Gly 


Tyr 


Leu 


Ala 


Gin 


Glu 


Gin 


Ala 


Arg 


Arg 


Gin 


Ala 


334 




450 










455 










460 










337 


Gin 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


His 


He 


Arg 


His 


Met 


Ser 


Asn 


Lys 


Gly 


3 38 


4 65 










470 










475 










480 


341 


Met 


Glu 


His 


Leu 


Tyr 


Ser 


Met 


Lys 


Cys 


Lys 


Asn 


Lys 


Val 


Pro 


Leu 


Tvr 


3 4 2 










485 










490 










495 




34^ 


Asp 


Leu 


Leu 


Leu 


Glu 


Met 


Leu 


Asp 


Ala 


His 


Arg 


Leu 


His 


His 


Pro 


Val 


346 








500 










505 










510 






349 


A rg 


Ala 


Pro 


Gin 


Ser 


Leu 


Ser 


Gin 


Val 


Asp 


Arg 


Asp 


Pro 


Pro 


Ser 


Thr 


350 






515 










520 










525 








353 


Ser 


Ser 


Gly 


Gly 


Gly 


Gly 


He 


Ala 


Pro 


Gly 


Ser 


He 


Ser 


Ala 


Ser 


Arg 


354 




530 










535 










540 








357 


Gly 


Arg 


He 


Glu 


Ser 


Pro 


Ser 


Arg 


Gly 


Pro 


Phe 


Ala 


Pro 


Ser 


Val 


Leu 


358 


54 5 










550 










555 










560 


361 


Gin 


Tyr 


Gly 


Gly 


Ser 


Arg 


Pro 


Asp 


Cys 


Thr 


Pro 


Ala 


Leu 


Gin 


Asp 




362 










565 










570 










575 




365 


<2 10 


.> SEQ ID 


NO: 


3 

























366 <211> LENGTH: 20 

367 <212> TYPE: DNA 

368 -:213> ORGANISM: ARTIFICIAL SEQUENCE 
370 <:220> FEATURE: 

37 1 *:223> OTHER INFORMATION: SYNTHETIC DNA 

37 3 <4(.)0 > SEQUENCE: 3 

374 tnggtgacat gtaccctgaa 20 

377 -:210 > SEQ ID NO: 4 

378 <211> LENGTH: 20 

379 <212> TYPE: DNA 

380 <213> ORGANISM: ARTIFICIAL SEQUENCE 
382 <220> FEATURE: 

38 3 <223> OTHER INFORMATION: SYNTHETIC DNA 
385 <400> SEQUENCE: 4 

3Q(> cr.gtutgctc agtcttgaag 20 
3 8'^ <-:j10> SEQ ID NO: 5 
39D <21H LENGTH: 20 

39 1. <;212> TYPE: DNA 

39J <:213> ORGANISM: ARTIFICIAL SEQUENCE 
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<210> 6 
<211> 20 

<212> DNA ■— 

<213>^ ARTIFICIAL DNA 



<400> 6 

gtaggaggtc ataaagaggg 20 
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L:404 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 6 

L:406 M:258 W: Mandatory Feature missing, <220> FEATURE : 

L:406 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 
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